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1.   Introduction 

The  purpose  of  this  paper  is  to  discuss  the  estimation  of 
exact  aggregation  models  by  the  nonlinear  three  stage  least  squares 
method  (NL3SLS)  of  Amemiya  (1977),  Gallant  (1977),  Gallant  and  Jorgenson 
(1979),  Jorgenson  and  Laffont  (1974)  and  others.   In  exact  aggregation 
models  there  is  a  unique  correspondence  between  individual  behavior  and 
aggregate  behavior.   This  property,  known  as  recoverability ,  makes  exact 
aggregation  models  appropriate  for  the  analysis  of  individual  data, 
average  data  or  both  in  combination.   In  this  paper  we  consider  estima- 
tion using  both  cross  section  data  on  individuals  and  average  time 
series  data.   We  present  the  estimator,  discuss  its  properties,  demon- 
strate its  similarity  to  linear  pooling  estimators,  and  indicate  its 
computational  advantages  over  other  nonlinear  estimators. 

The  structural  form  of  exact  aggregation  models  for  individual 
agents  is 

Yikt   =   Xkt  6i  (V  6)  (i  =  1,  ....  q) 

where  k  indexes  agents,  t  indexes  time  periods,  y   ,  i  =  1,  ...,  q  are 

lKt 

q  dependent  variables,  x^  =  (x^,  ...,  x  )  is  a  Q  vector  of  pre- 
dictor variables  which  may  vary  over  both  agents  and  time,  and  p  is  a 
vector  of  variables  which  vary  over  time,  but  are  constant  across  agents 


074494* 


in  a  given  time  period.   The  Q  vector  of  coefficients  &      (pt>  6),  is  a 
function  of  p   and  an  L  vector  of  structural  parameters,  8'  =  (6   ...,  Q^) , 
for  i  =  1,  ...,  q.   All  restrictions,  including  cross  equation  and  exclu- 
sion restrictions  are  embodied  in  the  form  of  g^  (p  ,  9),  i  =  1,  ...,  q. 

We  write  the  system  of  structural  equations  for  each  individual 
in  vector  format  as: 

*kt   =   (Iq  ®    ^   B  <V  6)  (1) 


where: 


ykt   =   (ylkt'  •'•'  W' 


b  (pt,  e)'    =    (^  (pt,  e)\    •  •>  eq  (pt.  e)')'  and  xq  is  the  identity 

matrix  of  order  q.   By  averaging  (1)  over  agents  for  each  time  period, 
we  obtain  the  structural  model  for  averaged  data  as: 


yt  =   (Iq  ®  *t}  B  (V  e) 


(2) 


where  y   and  x   are  q  and  Q  vectors  of  averages  of  ykJ.  and  xfct  respec- 
tively. 

The  recoverability  property  is  reflected  in  the  fact  that  (1) 
and  (2)  display  the  same  parameter  vector  6  and  coefficient  vector 
g  (p  ,6).   Moreover,  the  forms  (1)  and  (2)  are  necessary  and  sufficient 
for  the  recoverability  property,  if  the  population  distribution  of  xkt 
is  unrestricted  (c.f.  Lau  (1980)). 


Many  examples  of  exact  aggregation  models  can  be  found  in  the 

literature.   The  simplest  structural  form,  which  underlies  most  discus- 

2 
sions  of  pooling  time  series  and  cross  section  data,   is  the  linear 

model. 

yikt  =  Pt  0li  +  *kt  92i  (1  =  l'  ••"  q) 

where  8    and  6    are  vectors  of  constant  parameters.   The  vector  9  of 
(1)  and  (2)  consists  of  the  elements  of  8   ,  9 „  ,  i  =  1,  ...,  q, 

For  illustration  of  more  general  exact  aggregation  structures, 
we  present  some  examples  from  demand  analysis.  All  examples  of  the 
Gorman  Polar  Form  are  exact  aggregation  models  —  the  linear  expenditure 
system  introduced  by  Stone  (1954)  and  the  various  S-branch  utility  models 
such  as  those  in  Brown  and  Helen  (1972)  and  Blackorby,  Boyce  and  Russell 
(1978).  In  exact  aggregation  form,  each  equation  of  the  linear  expendi- 
dure  system  appears  as: 

yikt  =  (pit  Ci  "  bi  \  ci   PU>  +  bi  \t 

where  y..   denotes  expenditure  on  good  i  by  family  k  in  period  t,  M^ 
denotes  total  expenditure  and  p'  =  (Plt,  •••»  P  fc)  is  the  vector  of 
(common)  prices  at  time  t.  The  vector  9  consists  of  the  parameters  c, , 
...,  c  and  b1 ,  ...,  b  ,  and  x'   =  (1,  M.,.)- 

Of  more  recent  interest  are  the  AIDS  system  of  Deaton  and 
Muellbauer  (1980)  and  the  translog  model  of  Jorgenson,  Lau  and  Stoker 


(1982).   In  our  format,  the  AIDS  system  appears  as: 


^ikt   =    (ai  +  rci.i    *np.it)Mkt+inHr-  \t£nMkt 


where  v.,  ,  M,   and  p   are  as  above,  and  Jn  P   =a  +  !!  a.  In  p,   + 
lkt   kt      t  t    o      £     it 

-~  I  I  c.,  In  pn^    in  p.^.   The  vector  0  consists  of  the  para- 
l  £j      it     j t 

meters  a,,  ...,  a  ,  b,  ,  ....  h  ,  and  c,,,  ...,  c   ,  and  x.1   =  (M,  ,  M, 

l      q   1      q      il      qq     kt    kt   kt 

Pn  M.  ) .   The  translog  model  of  exact  aggregation  appears  as: 
a .  +  E  b .  .  in  p  . 

yikt =  ( — dtf^ >  Mkt  -  mp7  \t in  \t 


Hb.AR      Au 
is     let      skt 

S   D(pt) 
where  D(pt>  =  -1  +  Z   b   in  p.t;  Ylkt.  \t   and  pfc  as  before,  and  Agkt> 

i 

s  =  1,  ...,  S  denotes  S  demographic  variables  such  as  family  size,  age 

of  head,  etc.   The  vector  6  consists  of  the  parameters  a-,  ...,  a  ,  b, , , 

1         q    11 

....  bqq,  b*r  ....  bjs,  and  bm,  ...,  bMq,  and  x^  =  (M^,  M^  in   M^, 
M.   A   ,  ...,  M   A   ).   In  each  of  these  examples,  the  theory  of 

ICC    J.KC  Kt   SKl 

consumer  demand  places  contraints  on  the  parameters  of  the  model,  which 
reduces  the  size  of  the  parameter  vector  6 . 

In  the  next  section  we  formally  present  the  model  and  stochastic 
structure,  as  well  as  the  assumptions  required  for  estimation.   We  then 
present  and  characterize  the  NL3SLS  estimator.   We  conclude  with  an 
empirical  example  involving  substitution  among  different  types  of  energy 
in  the  United  States. 


2.   Stochastic  Structure  and  Instrumental  Variables 


We  begin  by  assuming  that  t  =  1,  ...,  T  denotes  the  time  peri- 
ods under  consideration,  with  K  the  size  of  the  population  at  time  t. 
For  each  t  we  observe  the  vector  p   and  the  averages 


(t  =  1 T) 


In  addition,  at  time  t  we  observe  a  cross  section  data  set  of  size 
'  o 

N  <  K   :  i.e.  we  observe  y,    and  x,t   for  k  =  1,  .  .  .  ,  N. 
t  kt       kt 

o  o        o 

For  individual  behavior,  we  assume  that  the  model  (1)  is  cor- 
rect up  to  a  stochastic  component  e   .   The  model  representing  the  cross 
section  data  is  then: 


?ykt 

?   Xkt 

k 

k 

't    =      K 

'    At           K 

t 

t 

ykt  =  (Iq©Xkt  }  6  (Pt  '  6*}  +  Ekt  '   (k  "  X*  •••'  N)* 


(1') 


where  9*  denotes  the  true  value  of  9  to  be  estimated.   We  can  obtain  the 
stochastic  version  of  model  (2)  by  averaging  (1*).   This  is  the  appropriate 
procedure  if  the  average  data  are  constructed  by  averaging  across  all  indi- 
vidual observations.   Often,  however,  this  is  not  the  case.   For  example, 
in  consumer  demand  analysis,  data  on  aggregate  expenditures  are  obtained 
from  production  accounts  for  the  economy  as  a  whole,  rather  than  by  direct 
observation  of  quantities  consumed  by  households.   To  allow  for  errors  of 
observation  in  the  average  data,  we  introduce  an  additional  stochastic  term 


v,   for  each  individual.   The  model  relating  y  to  x  and  p  is  then: 
kt  b  3 t  t     Kt 

yt  =  (I   0  x')  6  (pt,  6*)  +  ut,  (t  =  1,  ....  T)  (2*) 

where  u   =  Z  e.   /  K  +  I  v   /  K  . 

L       ,    Kl      l      _Kl      l 

k  k 

We  make  the  following  assumption  about  the  errors  e,   and  v,  : 

kt      kt 


Assumption  1:   (Stochastic  Structure) 

e,   is  distributed  with  mean  0  and  positive  definite 

covariance  matrix  ft  .   e,   and  e,  .  ,  are  independent 

e.  kt      k' t 

for  k  4   k'  or  t  f   t'.  v,   is  distributed  with  mean  0 

kt 

and  positive  definite  covariance  matrix  £5  .  v    is 
r  v    kt 

distributed  independently  of  v     for  k1  /  k  or 

t'  4   t   and  of  t,  ,  , ,  for  all  k' ,  t'. 
k  t 

For  the  moment,  we  assume  that  ft  and  ft  are  known;  later  we  replace 
these  parameters  by  consistent  estimators. 

From  Assumption  1  we  have  immediately 
that: 


E  (utu;,)  =£   (fte  +  ftv)  ,  t  =  t',  (3) 

=  0  ,  otherwise  , 


and  : 


E  (ekt  UP  =  V  %   .  t  =  to  , 

o  t 


=  0  ,  otherwise 


(4) 


We  denote  P.  +  n  by  fl  . 
e    v   '   u 

To  consider  estimation  of  the  complete  model,  we  stack  the 
equations  (1')  representing  the  cross  section  as 

Y  =  (I  @  x)  3  (pt  ,  9*)  +  e  ,  (5) 

o 

where  Y'  ■  (ym  ,  y^   ,  ...,  ym  ,  y21t   ,  ...  ,  y2Nt  ,  ....  y    ) 

o      o  o      o  o  o 

is  the  Nq  by  1  matrix  of  dependent  variables,  X  is  the  N  by  Q  matrix  with 

kth  row  x'   ,  and  c  is  the  Nq  by  1  matrix  of  disturbances,  distributed 

Kt 
O 

with  mean  0  and  covariance  matrix  fi  @)    I  .   We  represent  the  aggregate 
equations  (2')  as 

Y  =   f  (9*)  +  u  .  (6) 

wnere  i      11*   12'  *  *  * '   IT*   21*  "**•   ?t '  *  *  * '  a  1 '  " * " '   aT 

is  the  Tq  by  1  matrix  of  average  dependent  variables,  f(9)'  =  (x'  8.(p.,  9*), 

x2  81(p2,  8*),  ...,  5j,  61(pT,  9*),  x^  62(Pl,  8*),  ....  x^  6q(pr  6*)) 
is  the  Tq  x  1  matrix  of  structural  terms,  and  u  is  the  Tq  by  1  matrix  of 
disturbances,  distributed  with  mean  0  and  covariance  matrix 

Qu   ®  P'P'  where  P  =  diag  (1/*^i 1/^). 

The  model  describing  all  of  the  data  jointly  is  obtained  by 

combining  (5)  and  (6),  taking  into  account  the  covariances  (4).   To  faci- 
litate discussion  of  the  pooling  of  time  series  and  cross  section  data, 


it  is  convenient  to  transform  (5)  and  (6)  to  eliminate  heteroscedasticity 
over  time  periods  and  to  produce  zero  correlation  between  individual  and 
average  disturbances. 

Heteroscedasticity  is  removed  by  transforming  (6)  by  I   @  P~  , 
where  P  '  =  diag  (v^T..,  ...,  vIL,) ,  a  standard  population  size  correction,  as  in: 

(I   ©  P"1)  Y   =   (10  P"1)  f  (6*)  +  (I   0  P"1)  u  ,    (7) 

where  (I  (x)    P   )  u  has  covariance  matrix  Q      (x)  I„. 
q  v— '  u  ^-s       I 

The  individual-aggregate  correlations  are  removed  by  a  nonsingular  trans- 
formation of  (5)  and  (7),  which  is  equivalent  to  replacing  v^K   y   ,  >^K   x 

o   o     on 

and  >^K   u   in  (7)  by  ^   y*  ,  v^K   x*  and  Jk       u*  ,  where 

tt  -'tt'tt  tt' 

oo  oooo  oo 

y*       -    Q  (yt     -   jP-    ycs)  , 

o  o  t 

o 

*t  -      Q    (*t        -     IT     *cs>    '  (8) 

o  o  t 

o 

u*  =     Q    (Ut        -     JL     ics)         . 

o  o  t 

o 


y      ,   x       and   e        represent   the   cross   section  averages  of  y,    x  and   e, 
-'cs'      cs  cs        v 

and  Q  satisfies: 

Q  (n  -(n/k  )n  )  q*  =  n  (9) 

u      t   e        u 
o 

where  Q  can  be  determined  uniquely  from  consistent  estimators  of  £2  and  fi  . 

After  applying  both  corrections,  we  stack  the  individual  model  (5)  and 


the  transformed  aggregate  model  as; 


=  (I  0  X)  s  (pt  ,  e*)  +  e  , 


(10) 


y*  =  f*  (e*)  +  u*  , 


and  denote  the  full  system  as; 


Y   =   f  (6*)  +  U  , 


(11) 


where  U'  =  (e',  u*'),  which  is  distributed  with  mean  0  and  covariance 
matrix 


T.   =  E(U'U)   = 


"   ©  h 


0 


"u  ©  T1 


The  existence  of  a  transformation  which  separates  cross  section 

and  average  time  series  disturbances,  such  as  (8),  does  not  require  the 

3 
specific  stochastic  assumptions  employed  here.    However,  a  few  remarks 

on  the  rplfvance  of  the  correction  (8)  are  in  order.   First,  ignoring  the 
correction  (8)  in  the  NL3SLS  procedure  amounts  to  ignoring  the  individual- 
aggregate  disturbance  covariances,  which  affects  only  the  efficiency  of 
the  estimator  and  not  its  consistency.   Second,  the  importance  of  the  cor- 
rection depends  on  the  size  of  the  cross  section  relative  to  population 

size.   In  many  applications,  such  as  the  example  given  below,  N/K   will 

o 
be  extremely  small,  with  the  correction  leaving  the  data  unaffected.   Typi- 
cal numbers  for  an  analysis  of  U.S.  household  demand  behavior  are  N  =  10,000 


and  K 


70  million  in  1972.   Only  when  the  cross  section  sample  size  is 


10 


of  the  same  order  of  magnitude  as  the  population  size  will  the  correction 
yield  significant  benefits;  otherwise  it  can  be  ignored. 

Before  presenting  the  additional  assumptions  required  for  esti- 
mation of  the  complete  model,  we  introduce  instrumental  variables.   It  is 
often  appropriate  to  treat  some  of  the  predictor  variables  x   and  p  as 
endogenous  for  the  individual  observations,  the  aggregate  observations, 
or  both.   This  can  occur  when  the  model  is  a  simultaneous  equations  model 
in  exact  aggregation  form  or  part  of  a  larger  system  of  simultaneous  equa- 
tions.  For  example,  in  demand  analysis  the  aggregate  data  on  prices  can 
reflect  both  supply  and  demand  influences,  requiring  aggregate  instruments, 
with  individual  total  expenditure  and  demographic  variables  exogenous. 
Alternatively,  in  a  study  of  savings,  errors  in  variables  may  necessitate 
instruments  for  the  individual  data,  while  in  the  average  data  such  errors 
may  be  negligible. 

We  assume  that  there  are  s-vectors  z,  _   ,k=l,  . ..,  N  used 

kt 
o 

for  instrumenting  the  individual  data  and  s-vectors  z  ,  t  =  1,  ...,  T 

used  for  instrumenting  the  average  data.   Denote  as  Z  and  Z  the  matrices 

with  rows  z,1   and  z  respectivelv,  and  as  Z  the  matrix: 
kt      t 


Z  = 


I   ®  Z 
q   w   o 


Iq  ©  Z 


Fo 
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Following  Gallant  (1977),  we  make  the  following  assumptions: 
Assumption  2:   The  parameter  space  of  6,  say  0,  is  compact, 
with  the  true  value  an  interior  point. 

Assumption  3:   The  components  of  6 .  (p  ,  9) ,  i  =  1,  . . . ,  q, 
are  twice  continuously  dif ferentiable  in  6.,  j  =  1,  . ..,  L. 

r  the  next  two  assumptions,  we  use  the  notation  &]    (p  ,  0)  and  8.   (p  .  9) 


to  refer  to  the  vectors: 


i  3  6il  3  BiQ 

6i    (pf  6)    =    (y~6~_  '    ••"   'fTTy  ' 


2  2 

3  6.n         3  8.n 

Pi   (Pt'  y;   "   (390  39.  '  •••'  39n  39. ;  ' 


wnere  6.   is  tne  mth  component  of  8.  (p   9). 


Assumption  4A  (Cross  Section). 

The  matrix  —  Z  'Z  converges  to  a  positive  definite  matrix  as 
N  o  o 

N  -*■   °°.   The  Cesaro  sums, 


n!  (^ikt  -xkt   h  (pt  ■  6))(^kt  ~Kt   6j  (pt  •  e»  ■ 

o  o  o  Jo  0  ° 


h  zkt  (^kt-xkt  Bi  (pt  ■ e))  • 

K  o  o 


FkEzkt  (xkt  6i£  (pt ■  e». 

K  o  o  o 
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converge  almost  surely  uniformly  in  9,  for  all  i,  j  =  1,  ..., 

:.    =r.i    ;.   =   1,    .  .  .  ,    L.      The   sums, 


1   _ 


sui  z   .. 


N  ■:■      ~rl  s'k 

K  o  o 


t  (xkt  '  £i£  (?t  ■  e)  ' 


—  =     2s-kt  (xkt  ■   si£  (pt .  B»l. 


o  o  o 


are  bounded  almost  surely  for  all  i  =  l,  ...,q;j,  £.  =  1,  ..., 
"_  and  s'  =  1,  ...,  s,  where  z  ,    is  the  s'th  component  of  z 


Assumption  AB  (Time  Series) . 

The  matrix  —  Z'Z  converges  to  a  positive  definite  matrix 

2S  I  +  ■»,   The  Cesaro  sums, 


|iKt  (yit-x;  s.  (pt,  9))  (yjt-x;B.  (pt,  e)) 


k\***t  (>'it-^Bi  <V  6))' 


|^tit  (x;  4  Cpt,  e))  , 


converge  almost  surely  in  6,  for  all  i,  j  =  1,  ...,  q  and 
4=1,  . . . ,  L.   The  sums , 


13 


^:sc?s   is,t  (x;     pt.  B))|  , 


T  t 


izs«pe  '  is,t  u;   5.1"  (Pt,  8))L 


t 
are  bounded  almost  surely  for  all  i=l,  ...,  q  j,  1=1,    .. 

L  and  s'  =  1,  ...,  I,  where  zw      is  the  s'th  component  ::  z_. 

Assumption  5:   The  matrix 


;--     — - —  (M*  +  >'*)  , 
N  +  I   xo    x'  ' 
N,T  -*•  « 

is  nonsingular, 

where  M   and  M  are  defined  In  equation  (16)  c:  the  next  ;e: 
xo      x 


Assumption  6:   (Identification). 

:£  (11)  is  identified  by  the  instrumental  variables 

z    and  z  ;  i.e.  the  onlv  solution  of  the  alnost  sure 
kt       t' 
o 

limits 


»  -*■  as    N  K       O 


l2a: 


(i  =  1 -/ 


Y  l     v^t  *t  (yit  "  5t  £i  (pt'  e))  =  ° 

T  -*■  <=     t 

(i  =  1,  ...,  q) 

is  the  true  value  9  . 


:::- 
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Assurr-tirr.-  1  art  ]  :;::;  =  ;-.:  standard  re :  .irerer.ts  of  -.on- 
linear  ridels.   Ass .-' n :- 5  -A,  -1    and  :  retlaie  the  usual  large  sarrle 
desitr  !=::::■:  ass_rrti:rs  ::  lirear  thetry.   A5sur.rr.c-  t  is  ar.alo- 
;:-=  r:  '.--    _s_al  Jefinitiot  ::  i_s  :r  __er:al  variables  for  lir.ear 

-  :  -  -  1  r 

a    -  plieitly  separated  the  ass_rrti:h5  :r  the  :r:;s  =e:- 
titr  rcdel  ar  i    tire  series  rrrel  ir.  :rier  t:  facilitate  the  distussicr  of 
se:ere:e  HL3SLS  estimators  for  e;:   cars  set.   A  rsra_eter  -.  is  ide_tified  in 
the  :r:;s  settitr.  tttdel  iff  ir.  the  stlution  ::  the  almost  sure  lirit 
11a   "   is  necessarily  e:_a_  t:  E  .' r.  the  true  value.   Sinilarlv,  a 
iirsreter  is  identified  ir.  the  tire  series  If  it  is  uniquely  determined 
:    111  .   We  i:llett  all  rarareters  idertified  ir  the  ;r:ss  settitr.  ir 
a  vector  B  ,  all  rarareters  identified  ir  the  tine  series  ir.  a  vector  r. 
art  all  if  the  renairiri  tarareters  ir.  a  vector  '-' . 


3. 


rhe   BL3SLS    e;::r;::r    r    ::      "    l;    found    a;    the      al-e    ::    -    siiid 


miniiaizej 


e     a)     = 


(e))'   [i  : 


__    .    -  V      _       .        1 


:: 


-•  -.era 


%    © 


u   v> 


is  a  ;::;:;:£-:  £:::r£::r  ::'  I  as  S,  I 
explicitly  ;.- 


5   B    can  be   writter  =:ra 


S  (8)   =  S^  (6)   +  S  (9), 
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m.  th : 


-:~~  a •  =  -:  -  :.  5    f  (ct  ,  ;    ■;-  ^       ~_     :  I   --  -.   - 


s(9)  =  (y*  -  f*  (8))'[sy  0  :  ::"-;;  t*  -  f* 


--ere  clearly  5"    and  S  7   ere  NL3SLS  objective  functions  fox  the  cross 

section  and  average  nodels      idually  could  1>«  ■   "- 

~Lzez    z:    esrirate  7  ,    for  fixed  values  of  the  regaining  rararete:- 
similarly,  5    could  re  niririzei  to  estiaate  B.   If  6  *  6  =  f .  then  all 

tara-eters  could  re  estimated  fros  :::;:  data  ee:   minimizing   1-   ::-- 
strair.s  the  estirated  values  froa  cross  sectier  ar.d  tire  settee  data 
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sets  to  be  equal,  which  results  in  efficiency  gains.   The  advantage  froir 

pooling  lies  in  obtaining  more  efficient  estimates. 

Note  that  S  (6)  can  be  evaluated  using  only  ft 

and  the  moment  matrices  Z1  X,  Z'Z   and  (I   (x)  Z  ' )  Y.   Thus  for  estimating 

o     o  o       q  y~-'    o 

9  or  other  more  restricted  parameterizations  of  B(p  ,  9),  only  one  pass 
through  the  cross  section  data  is  required  to  construct  these  moments. 
This  is  the  main  computational  advantage  provided  by  exact 
aggregation  models. 

The  estimation  procedure  consists  of  three  steps:   First,  find 
consistent  estimators  of  ft  and  ft  ;  second,  minimize  (12)  to  obtain  6: 

E        U  ' 

third,  calculate  the  asymptotic  covariance  matrix  of  9.   If  9   is 
not  empty,  then  we  cannot  improve  upon  previous  suggestions  in  the  lit- 
erature for  finding  consistent  estimators  of  ft  and  ft  :  for  example, 

e      u 

Gallant  (1977)  suggests  estimating  each  equation  of  the  model  by 

NL2SLS.   This  involves  pooling  both  data  sources  on  a  single  equation 

basis  ,  forming  ft  as  the  estimated  residual  covariance  from  the  cross 

section  data,  and  forming  ft  as  the  estimated  residual  covariance  from 
'  u 

the  average  time  series  data. 

The  more  usual  situation  is  that  9   is  empty,  which  suggests 

a  simpler  procedure.   First,  obtain  consistent  estimates  of  6(p   ,  9)  by 

o 
(linear)  2SLS  estimation  of  each  equation  using  the  cross  section  data. 

The  estimated  residual  covariance  matrix  ft  provides  a  consistent  esti- 

E 

mator  of  ft    even  if  9,  is  not  empty.    Using  the  consistent  estimators 
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of  6(pt  ,  G),  solve  for  consistent  estimates  of  6°,  say  6°.   Holding  9° 
o 

fixed  at  0  ,  estimate  the  remaining  parameters  of  8  by  applying  NL2SLS  to 
each  equation  of  the  model  or  NL3SLS  to  the  system  as  a  whole,  using  only 
the  time  series  data.   The  estimated  covariance  matrix  of  the  NL2SLS  residu- 
als, fi  provides  a  consistent  estimator  of  fi  .   In  addition,  this  procedure 
u  u  '       r 

usuallv  produces  eood  starting  values  for  6  to  use  in  minimizing  (12). 

The  objective  function  (12)  can  be  minimized  using  a  variety  °f  well 

known  computational  methods.   A  convenient  one  that  illustrates  pooling  cross 

section  and  time  series  data  is  the  Gauss-Newton  process.   To  discuss  this 

method  we  require  the  following  notation:  Let  B  (e)  denote  the  matrix: 

o 


Bo(6)   = 


Pol  <6>     1 


I   > 


oq         J 


where  BQi  is  the  Q  by  L  matrix  with  jth  column  g^  (p   ,  6)  ,  i 

o 
j  =  1,  . . . ,  L.   Let  ty   (0)  denote  the  matrix: 


1, 


,  q, 


*  (e)  = 


*   (6) 


*q  <P> 


where  ij^  (0)  is  the  T  by  L  matrix  with  t,j  element  /i<  x'  6^  (p  ,  0),  for 
i=l,  --..q,  j  =  1,  ...,L  and  t  =  1,  ...,  T. 
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The  Gauss-Newton  process  is  an  iterative  procedure  for  finding 

9  from  an  initial  value  6,.   At  the  ith  iteration,  the  current  value  9. 

1  l 

is  updated  to  9    =  9.  +  A  9.  by  first  linearizing  the  system  (11)  with 
respect  to  9  as: 


Y  -  (I   ©  X)  8  (p   ,  9.)  ~   (I   ®  X)   pQ  (9.)  A9.  +  E  , 

o 

(14) 
Y*  -  f*  (9.)^  i>    (6.)  Ae-  +  u 


We  then  apply  Zellner  and  Theil's  (1962)  linear  three  stage  least  squares 
method  to  the  model  (14),  obtaining: 


A  9.   =   (M   +K)1(M   +  M  )  ,  (15) 

l       xo    x      eo    u 


where : 

M   =  B  (9.)'  (Q_1  fx)  X'Z   (Z  'Z  )_1  Z'  X)  B   (9.) 

xo   >o  i  e  v— '    o   o  o     o    ro   i 

M   =  B  (9.)'  (fi_1  (x)  X'Z   (Z  'Z  )_1Z')  (Y-(I   (x)  X)  6  (p^  ,  9.)) 

eo   "o      l  e  ^    ooo    o       q^^        t    i 

Mx  =  ^  (9\)'  (ft"1  ©  ZCZ'Z)"1!')  *  (6i) 

M   =  ii   (9V  (n_1  (x)  Kz'z)"1!')  (?*-f*  (0,))  . 
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Convergence  to  6  is  achieved  when  A  6.  becomes  sufficiently  small.   In 
practice,  however,  we  have  found  Hartley's  (1961)  suggestion  useful, 
namely,  we  check  whether  S(9.  +  A  6.)<  S(9.)  ;  if  not,    we  shrink  1   r_. 
by  forming  9    =  9.  +  A  6./2.   Checking  continues  until  either  improve- 
ment in  S  is  found,  at  which  point  a  new  iteration  is  performed,  or,  if 
the  current  increment  falls  below  a  convergence  criterion, 

we  have  found  9=6.. 

l 

Under  our  assumptions  the  NL3SLS  estimator  5  is  consistent  for 
9*  as  N,  T  ■*  <=,  and  asymptotically  normal  with  asymptotic  covariance 
matrix: 


AVAR  (6)   =   (M*  +  M*)"1  ,  (16) 

xo    x 


where  the  *'s  indicate  (15)  evaluated  with  the  true  values  H  and  .". 
(c.f.  Gallant  (1977)).   The  precise  form  of  the  limiting  normal  distri- 
bution depends  on  the  way  that  N  and  T  approach  infinity;  however,  a 
consistent  estimator  of  AVAR  (6)  is  (M   +  M  )~   in  any  case,  so  this 
problem  is  of  secondary  concern. 

Several  points  of  interest  arise  from  a  closer  inspection  of 
(15).   First,  the  relation  to  linear  pooling  estimators  is  apparent;  for  example, 
if  6°  =  9  =  9,  then: 


r.  v-1 


A  6.  =  (M   +  M  )~x    (M   A  6"  +  M  A  6.)  , 
l      xo    x'      xo     l     x     1 


where  A  6.  and  A  9.  are  the  Gauss-Newton  increments  from  minimizing 
l        l 


:: 


=       . .- :     ; 


=     ■ 


'-  i       -      I  -  .  I  '   .  i      'IT      :.~r 


V 


■  - 


-_■-    ---  -  --------- 


-     - 


-■ 


- 

--------- 

-  - 


------ 

--z   -  ' 

.      -  \ 

.  -  i       I'  '- 
,_--:       --------       Z*       U 

OTU     0*    -  '), 


u 


r-=    :.:::-    |    :       Ls 


ilch  satisfies  g(e      =   t1 


:    :; 


.      ::=::::.:£:    ::r    i_. 


_-    -  -       -     -         -  -  -    - 


5r(p)  =  (Y  -  *(g(c  : 


:;:-.:; 


- 


tions      ; .:       Gallant    and    ;;::£-;-       1:";         --:er    :'-  =    -.'-'.    -    :: 


-    =    5_    :       -    ?    r     . 


' 


::~ 


)CL  . — Ls. 


.    '    -  - 


r-it£    :£5t    statistic    is    pro    idek    -     r. 

•-- .-  ---..--    of  S      :      to    tiod    :    is    analogous    to    the 
og    f.    aE  -T£;    r - 1  ■     ~;-i~~   as t r i c e =    :r;~    r. ~ =    ;: 

£  ~    .1 "  sis 
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of  Z  can  be  used  in  evaluating  (17),  the  desirable  monotonicity  condition 
S  (p)  -  S(5)  >  0  will  be  guaranteed  if  the  same  I  is  used  to  evaluate 

both  S  and  S.   Thus,  the  original  consistent  estimates  ft  and  ft  used 

r  e      u 

in  estimating  9  should  be  used  in  finding  estimators  for  restricted  ver- 
sions of  the  model. 

The  final  topic  we  consider  is  the  estimation  of  9*  subject  to 
inequality  restrictions.   For  example,  in  demand  analysis,  an  integrable 
demand  system  must  obey  the  condition  that  the  Slutsky  matrix  of  compen- 
sated price  derivatives  is  nonpositive  definite.   The  unconstrained  estimator 
9  need  not  obey  such  restrictions  for  finite  samples:  thus,  it 
may  be  desirable  to  impose  them.   We  represent  such  restrictions  for- 
mally as 

*   (6)  >  0  ,  m  =  1,  ....  M'  ,  (19) 

m     — 

where  we  assume  <£   to  be  twice  continuously  dif ferentiable  in  each  com- 
m  J 

ponent  of  9. 

The  inequality  constrained  estimator  9  minimizes  S(9)  subject  to 

the  constraints  (19).   This  estimator  corresponds  to  a  saddlepoint  of  the 
Laeraneian  function: 

f,  =   S(9)  +  A'<J>  ,  (20) 

where  X  is  a  vector  of  M'  Lagrange  multipliers  and  $   is  the  M' vector  of 
constraint  functions.   The  Kuhn- Tucker  (1951)  conditions  for  a  saddle- 
point  of  this  Lagrangian  are: 

Ve£  =   v0  S(9)  +  V    ($(9))   =   0  , 
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and  the  complementary  slackness  condition: 
X'<$>  =   0  ,  X  >_  0  , 

- 

r  :  . 

- 

where  *(8)  is  the  M'  byL  matrix  with  i,  j  element  .  .~  . 

;  1 

To  obtain  the  estimator  E  we  begin  by  linearizing  the  model  as 

in  (14).   Next,  we  linearize  the  constraints  as 


c(e.+1)  =  i   (e.)  i  ai  +  i  (5.)  , 


where  6   is  the  current  iteration  value  of  the  unknown  Darameters.   He 
i 

then  apply  Liew's  (1976)  inequality  constrained  linear  three  s:age  lea; 
squares  method  to  the  linear  model,  obtaining: 


a"e   =  &V  +  Cm   +  m  )_1  i   (e.)'  x*  , 

i        l       x:     x        i 
where  &  0.  is  giver,  by  (15)  and  \*   is  the  solution  of  the  linear  complc 


mentarity  problem 


i    (5.)   (M    +  M  )_1i  (9.)*  X  +  f   B    &  6   -  ♦  (9  )  >  0  , 

l      x:     x        l  -      i        -   — 


where 


i   (e\)  [m  *  S  r1  i  5.)f  x  +  [i  a  ]  &e  -  t  (e  )]'  •  =  :. 

1     XO     X'         1  x     x         x 


X  >   0 
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Given  6   that  satisfies  the  constraints  (19),  we  update  to  6    = 

6.  +  A  9.  and  check  that  both  S(e...)  <  S  (6,),  and  that  <j>   (6.,,)  >  0, 
i      i  l+l        l  m   i+i  —  ' 

m  =  1,  ...,  M'.   If  not,  we  shrink  the  increment  vector  as  before,  until 
either  improvement  is  found  or  the  increment  values  fall  in  absolute 
value  below  a  convergence  criterion.   This  concludes  our  discussion  of 
the  NL3SLS  estimator. 
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4.   Example:   Household  Energy  Consumption 

The  purpose  of  the  model  we  employ  to  illustrate  the  nonlinear 
three  stage  least  squares  estimator  is  to  characterize  household  demand 
for  energy.   The  basic  assumption  of  this  model  is  that  each  household 
performs  a  two  stage  budgeting  process:   First,  total  expenditures  are 
allocated  among  all  other  goods  and  an  energy  aggregate;  second,  total 
expenditures  on  energy  are  allocated  among  four  individual  components; 
electricity,  natural  gas,  gasoline  and  all  other  fuels.   Since  the  rela- 
tive prices  of  the  components  under  consideration  are  not  constant  over 
time,  the  existence  of  an  energy  price  aggregate  for  each  household 

requires  homothetic  separability  of  the  four  types  of  energy  and  other 

,   9 
goods. 

As  is  now  standard  in  demand  analysis,  we  begin  with  the  indir- 
ect (dual)  subutility  function  corresponding  to  the  four  types  of  energy: 


Pi  P/ 

V    =  V    f-=^         — —  )  (21) 


where  M   denotes  total  expenditures  on  energy  for  family  k  in  year  t  and 
the  p.  ,  i  =  1,  ...,  A,  denote  the  prices  of  the  four  energy  components. 
We  assume  that  all  relevant  differences  among  households  can  be  parametrized 
by  a  vector  of  characterisitcs  A^  ,  so  that  (21)  can  be  rewritten  as: 
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Vkc   =  V  (—,...  )T^  ,  a.  )   ,  (22) 

.-•- 1  Kt 

where  V  is  c:~«  to  all  househrlcs.   The  relative  demands  for  all  four 
types  ::"  energy  are  found  by  an  abdication  of  Rcy's  (1943)  Identity 
as  : 


:_  ••  .   a  *■*  v 


.--. : 


ikt  In  P^/H^  :I1      3ln  Pqt/\t 


(i  =  1,    ...,    A)    ,         (23) 

•-■-.are  -"_...  15  the  share  of  energv  expenditure  allocated  to  the  ith  con- 
ponent,  i  =  1,  . . . ,  4. 

To  make  the  above  model  operational,  we  postulate  a  translog 
form  for  the  function  V: 


4        p.       4   4         p      p. 
it  it    rit 


=   Z   a,  L-  rr^-  +  ■=■  X       b, ,  in  - —  *n 
let 


J  i=l  j=l  "«  ""  "kt  ""  "kt 


(24) 


+   _   -   b .  A  .  .  in  tz — 

,    is   s.-;t    M.  „ 

1=1  s=_  let 

The  required  homogeneity  of  degree  -1  of  *.  irt:.ies 


I   a,  =  -1  ,    :   b    =  0  ,         ;  =  1,  ..-,  4, 
i=l  i=l   1J 

-    A 

:   b   =  o  s  =  l,  ...,  s. 

i=i   1S 


(25) 
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The  system  of  demand  functions  (23)  can  be  generated  by  atilit 
maximization  under  symmetry,  that  Is,  b. .  «  b. .,  i,j  =  1,  ...,£,  and 
monotonicity  —  for  all  positive  share  values  the  matrix  B  =  (b..)  nst 
be  nonnegative  definite.   Symmetry  can  be  imposed  directly,  while  mono- 
tonicity restrictions  take  the  form  of  inequalities:   Let  B  be  Cholesky 
factorized  as  B  =  LDL' ,  where  L  is  a  unit  lower  triangular  matrix  £ 
D  =  diag  (5  ,  ...,  £  ).   B  is  nonnegative  definite  if  6.        0,  i  =  1, 
...,  4,  where  from  (25),  at  least  one  '£ .  is  zero.   "<.£  i-pose  all  of  the 
homogeneity  and  symmetry  restrictions  at  the  outset  and  impose  the  mono- 
tonicity restrictions  if  they  are  required. 

Application  of  Rcy's  Identity  (23)  to  the  translog  form  (24) 
yields  share  equations  of  the  forn : 


"  Vikt  =  a±  +  X   V  In  P3t  +  s;,  "is  Askt  .  (i  -  1.  -.  **  (26) 


these  are  the  structural  equations  of  the  model;  we  add  =.   stocbastic 

term  E*.  .  : 
lkt 


4  S    s 

-w.,    =  a.  +  I  b..  £n  o.   +  I  b".   A  .   +  s*.    .       (27) 
lkt     l   .  .      li     it     ,   is  skt     lkt 

j=l   J     J  s=l 


We  assume  that  the  vector  e*  =  (e*  )  is  distributed  normallv  with  ~ea~ 

kt  lkt 

zero  and  covariance  matrix  7i*    ,  and  uncorrelated  over  time  and  over  house- 

£ 

holds.   Ve  treat  M.   and  A,   as  exogenous;  therefore, 


28 


we  assume  that  these  variables  are  uncorrected  with  E*t.   Notice  that 

e*  can  be  interpreted  as  representing  omitted  attributes  of  the  indivi- 
kt 

dual  household  which  are  uncorrected  with  the  included  attributes  A^. 

We  can  apply  the  nonlinear  three  stage  least  squares  estimator 
to  the  individual  and  aggregate  versions  of  the  model  (27),  noting  two 
differences  between  our  formal  development  and  that  required  for  this 
example.   First,  average  budget  shares  in  the  population  can  be  replaced 
by  market  budget  shares,  defined  by  weighted  aggregation  of  (27),  using 
total  energy  expenditures: 

7  M.   w  4 

_w  ,  .       ^  ikt  =  a.  +  I  b. .  la  p,t  (28) 

Wit"     ZM^       x   j=1  xi     It 

S  uA  jV  Askt  ,  Z  \t    (£ikt  +  vlkt> 
+  s=l   isT^  EM*t 

where  we  have  added  the  stochastic  term  ?*kt  as  before.   The  vector 

J     =  \v*      1  is  assumed  to  be  normally  distributed  with  mean  zero  and 

vkt    l  iktJ 

covariance  matrix  fl*,  and  uncorrelated  over  households. 

The  aggregate  form  (28)  necessitates  altering  the  error  correc- 
tion transformations  (7)  and  (8)  of  Section  2.   It  is  easily  seen  that 
the  aggregate  share  errors  will  be  -homoscedastic  if  each  equation  at 
time  t  is  multiplied  by  /iT   (previously  S%~) ,   where 

.   -  (I  V2  (29) 

t   ~      2 
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Removing  the  individual-aggregate  error  correlations  is  a  more  difficult 
problem.   Unfortunately,  the  correct  transformation  (previously  (8)) 
utilizes  the  precise  distribution  of  K,       in  each  cross  section  data  base. 
An  alternative  procedure,  which  sacrifices  a  slight  amount  of  efficiency, 
is  to  use  as  aggregate  data  the  market  shares  of  all  individuals  not 
observed  in  the  cross  section  in  each  year.   However,  as  indicated  earlier, 
our  cross  section  observations  represent  such  a  minute  proportion  of  the 
total  population  that  the  aggregate  data  would  be  unaffected  by  removing 
them.   Consequently,  we  ignore  the  individual-aggregate  corre- 
lations. 

The  second  difference  between  the  models  (27)  and  (28)  and  (1') 
and  (2')  arises  from  the  fact  that  both  the  individual  budget  shares 

w.,   and  market  shares  w.   add  to  unity,  i  =  1,  ....  4.   This  implies 
lkt  it 

that  e   and  v   have  singular  distributions,  with  Q     and  fi  singular 
kt      kt  e      v 

matrices.   We  remove  this  problem  by  omitting  one  equation  from  the  model 
for  estimation  (setting  q  =  3) ;  we  can  solve  for  estimates  of  the  para- 
meters of  the  omitted  equation  from  the  estimates  for  the  first  three 

12 
equations.  '   The  error  vectors  of  the  system  of  three  equations  are 

assumed  to  satisfy  Assumption  1. 

We  are  now  in  a  position  to  discuss  the  empirical  application 
in  detail.   As  indicated  above,  our  objective  is  to  analyze  the  alloca- 
tion of  total  expenditure  on  energy  among  four  types  of  energy.   The  basic 
consumer  units  are  households;  we  differentiate  households  amons  five 
demographic  characteristics  —  family  size,  age  of  head,  region,  race, 
and  type  of  residence. 


30 

We  have  represented  these  characteristics  in  the  attribute  vec- 
tor A^  as  qualitative  or  dummy  variables.   The  base  case  household  is 
represented  by  family  size  1,  age  of  head  15-24,  region  Northeast,  race 
White  and  urban  residence.   This  type  of  household  has  shares  given  by 
the  constant  terras. 

For  the  estimation  of  the  model,  we  utilize  five  cross  section 
data  sets:   the  1960-61,  1972  and  1973  Consumer  Expenditure  Surveys  (CES) 
of  the  Bureau  of  Labor  Statistics,  and  the  1973  and  1975  Lifestyle  and 
Energy  Use  surveys  performed  by  the  Washington  Center  for  Metropolitan 

Studies  (WCMS).   Data  on  the  four  energy  categories  are  quite  compatible 

13 
across  these  surveys.  "   There  are  13,098  observations  in  the  1960-61 

CES,  8879  in  the  1972  CES,  8898  in  the  1973  CES,  579  in  the  1973  WCMS 
and  2970  in  the  1975  WCMS,  making  a  total  of  34,424  cross  section  obser- 
vations. 

The  aggregate  price  and  quantity  series  are  obtained  annually 
from  the  National  Income  and  Product  Accounts,  for  the  years  1946  to 
1978.   The  quantity  series  are  taken  as  the  corresponding  components  of 
personal  consumption  expenditures  in  constant  dollar  form,  with  the  price 
series  the  associated  implicit  deflators.   The  statistics  IMA/EM  in  (28) 
are  shares  of  energy  expenditures  for  each  demographic  group.   These  are 
constructed  on  an  annual  basis  following  the  technique  of  Stoker  (1979) 
which  utilizes  income  distribution  data  obtained  from  the  Current  Popula- 
tion Reports,  Series  P-60. 
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Since  we  view  the  demographic  variables  as  exogenous,  we 
require  no  instruments  for  the  cross  section  data  (setting  Z  =  X  respec- 
tively for  each  cross  section),  and  utilize  OLS  on  each  equation  for 

14 
getting  starting  values  and  consistent  estimates  of  fi  .     In  addition, 

in  order  to  compensate  for  slight  differences  among  data  sets,  we  utilize 
only  within-sample   deviation  from  means  moments  and  pool  the  various 
data  sets,  weighting  by  the  within  sample  covariance  matrix  estimates, 
again  sacrificing  a  slight  amount  of  efficiency. 

The  aggregate  price  data  are  viewed  as  endogenous,  and  so  we 
utilize  fourteen  additional  variables  as  well  as  the  attribute-expendi- 
ture statistics  as  instruments.    We  obtain  both  starting  values  and  a 

consistent  estimates  of  0.     via  the  technique  described  above  —  estimat- 

u 

ing  the  demographic  coefficients  with  cross  section  data  and  while  hold- 
ing them  constant,  estimating  the  price  coefficients  with  the  time 
series  data. 

Results  of  the  pooled  estimation  are  presented  in  Table  1.   The 
estimates  were  obtained  by  fitting  the  equations  for  electricity,  natural 
gas  and  other  fuels;  estimates  for  the  gasoline  equation  derived  from 
homogeneity  and  symmetry  restrictions.   We  see  that  most  price  coefficients 
are  estimated  precisely,  except  for  the  coefficients  of  the  price  of  gaso- 
line and  the  own-price  coefficients  of  natural  gas.   This  leads  us  to 
test  the  hypothesis  that  gasoline  and  the  other  fuels  are  separable. 
Also,  it  is  easily  seen  that  the  inequality  restrictions  required  for 
monotonicity  hold,  so  they  need  not  be  imposed. 
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IABLE  1:   Notation 

-=re-de-:  Variables 

. .    ,    lze    0f    electricity frr  lighting,  heating,  use  of  aopliance 

5:  V   ....-■  «3for  heating,  cooking,  heating  water,  and 

-   sraft'ssar—  -  —^  «  ._  plu5  all ::.; 

— niS^«    aeration  of   autonobiles,    la-o  -overs , 
W3AS:  "  ::~:;;"    --"V    vehicles,    and   so   on. 

Predictor  Variables 

trice  of  electricity  , 

PE 

price  of  natural  gas  . 

-    price  of  fuel  oil  and  others, 
PSOL 

price  of  gasoline  . 


constant  (a^  , 

/  n,  s«2e  effects  for  sizes  2,  3,  4,  5,  6,  ' 
F2  F3,  F4,  73,  F6,  77  -  e^-        size  «ft«c 

rz»  -J'    '  or  riore  . 

f  w  effects  for  age  classes  25-34,  35-44, 
,30,  A40,  A50,  -:,  A70  -  £«  headrffec ts  ^  ^ 

..„   -    region  effects  fcr    rtl  =en«als 
,    race  effect  for  Bonvhites, 

m  .    residence  effect  for  rural  residence  . 


weighted  5  — 

f  ur.  c  c  i  o  r. . 


.,  squared  residuals:  -1-  =f  the  ^"^ 


TABLE  1 
Five  Cross  Sections* 


: : 


WE 


rGAS 

'SOL 
PGASO 

c 

F2 
F3 
F4 
F5 

F6 

n 

A30 

A40 

A50 

A60 

A70 

EJC 

BS 

RV 

BLK 

EL'S. 


.16616  (.0300) 
-.05383  (.0182) 
-.12651  (.0150) 
.01418  (.0268) 
-.19601  (.0063) 
.00876  (.0038) 
.0169S  (.0045) 
.00510  (.0049) 
.00114  (.0058) 
.02739  (.0072) 
.01866  (.0076) 
-.04522  (.0056) 
-.03068  (.0061) 
-.03498  (.0057) 
-.02949  (.0057) 
-.06150  (.0056) 
.00845  (.0039) 
-.04511  (.0039) 
.00627  (.0041) 
-.01643  (.0045) 
.01289  (.0036) 


-.05383  (.0182) 
.02768  (.0189) 
.04653  (.0101) 
-.02038  (.0178) 
-.05782  (.0055) 
.01163  (.0034 
.00605  (.0041] 
-.00069  (.0044] 
-.00940  (.0052) 
.00184  (.0065) 
.01868  (.0068) 
-.03043  (.0051 
-.05099  (.0054] 
-.0435S  (.0051) 
-.05042  (.0051) 

-.:£:":  .:::: 

-.07266  (.0035) 
-.03367  (.0035) 
-.05039  (.0037) 
-.08812  (.0040) 
.11150  (.0032) 


*  Asymptotic  standard  errors  ir.  parenthesis 
Convergence  After  3  Iterations 


-. 12551  (.0150) 
.04653  (.0101) 
.10882  (.0188) 
-.02884  (.0136) 
-.17469  (.0051) 
.00579  (.0026) 
-.01072  (.0031) 
-.00272  (.0033) 
-.01298  (.0039) 
-.02756  (.0049) 
-.03521  (.0051) 

-.:.-'■':      :::s 

-.02067  (.0041) 

-.03410  (.0039) 

-.05426  (.0039) 

-.10285  (.0038] 

.09626  (.2026) 

.11663  (.0026) 

.10528  (.0027) 

.00493  (.0038) 

-.11125  (.0024) 

-- 


.  .  *  .  -  - 

m    J  AT  - 


.01418 

.0268 

.02038 

[.0178 

.02884 

.0136 

.03504 

.0292 

.57238 

t.0743 

-.02617 

.  005C 

..01233 

<.0O60) 

■  .::::? 

:::- 

.02:2: 

(.0076) 

-.  ::::: 

(.0095 

-. ::::: 

:::: 

.09050 

.0074 

.10233 

(.0079) 

.1126: 

3075 

.13417 

(.0075 

.25005 

[.0073) 

-.02  225 

.... 

....  — 

-.03783        B051] 

-.06115 

(.0054] 

.09962 

: : : ; 

-.01314 

(.0047) 
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Turning  to  the  demographic  coefficients,  we  see  that  all  are 
significant  at  the  one  percent  level  except  for  family  sizes  2,  4,  5, 
North  Central  and  West  regions  in  the  electricity  equation,  family 
sizes  3  through  7  or  more  in  the  natural  gas  equation,  family  sizes  2 
and  4  and  Nonwhite  the  fuel  oil  equation,  and  family  sizes  4,  6  and  7  or 
more  in  the  gasoline  equation.   The  lack  of  significance  of  these  coeffi- 
cients is  not  a  problem  for  our  model,  since  this  indicates  no  differ- 
ence between  households  of  each  of  these  types  and  the  base  case  family 
of  size  one,  age  of  head  15-24,  region  Northeast,  race  White  and  urban 
residence.   Moreover,  for  all  the  demographic  coefficients  the  standard 
errors  are  quite  small,  so  that  an  insignificant  coefficient  actually 
represents  a  precisely  estimated  small  effect. 

The  price  effects  implied  by  the  estimates  in  Table  1  are  most 
easily  interpreted  in  elasticity  form.   There  are  different  sets  of  price 
elasticities  for  each  type  of  household.   A  complete  display  of  all  own 
and  cross  price  elasticities  for  a  given  price  vector  would  require  com- 
putation of  elasticities  for  each  of  the  672  different  types  of  families. 
Since  this  would  be  very  cumbersome,  we  present  in  Tables  2  A-E  own-price 
elasticities  for  all  four  energy  types,  varying  each  demographic  charac- 
teristic while  holding  prices  at  1972  values.   In  interpreting  the  own- 
price  elasticities  one  must  bear  in  mind  that  they  represent  elasticities 
of  the  energy  types  holding  total  energy  expenditure  constant.   For  example, 
the  effect  of  an  increase  in  the  price  of  energy  on  a  given  family's  con- 
sumption will  only  be  partially  represented  by  our  elasticities,  which 
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TABLE  2 
Own  Price  Elasticities 

A.   Varying  Family  Size 
2  3         A 


7   oi 


llectricity 

latural  Gas 

"uel  Oil  & 
Other 

lasoline 


-1.732991    -1.762442  -1.7Q2333  -1.740P47    -1. 736699  -1.833738  -I. 

-1.254490    -1.284875  -1.269492  -1.252834    -1.234197  -1.7SP799  -1. 

-1.557023    -1.574029  -1.52804*  -1.549385    -1.522322  -1.4««157  -1 . 

-1.074701     -1.0707S4  -1.0727*7  -1.074433    -1.071243  -1.074437  -1. 


15-24 


B.      Varying  Age  of  Head 
25-34  35-44  45-54 55-64  65   or  more 


electricity 

latural  Gas 

?uel  Oil  & 
Other 

Jasoline 


-1.8526*7    -1.692060    -1.736699  -1.722308    -1.740575  -1.64PU4 

-1. 411*47    -1.283502    -1.234197  -1.249*60    -1.235326  -1.181041 

-1.579846    -1.537328    -1.522322  -1.490*83    -1.419793  -1.374565 

-1.063697    -1.076230    -1.078243  -1.080090    -1.084232  -1.116745 


N.E. 


C.  Varying  Region 
N.C.  S 


W 


Electricity 
Natural  Gas 
Fuel  Oil  &  Other 
Gasoline 


-1.736*99  -1.765390  -1.613920  -1.757778 

-1.234197  -1.145039  -1.182277  -1.164196 

-1.522322  -1.970945  -2.186580  -2.055859 

-1.078743  -1.073018  -1.072146  -1.068844 


Electricity 
Natural  Gas 


D.   Varying  Race 
White      Nonwhite 


Varying  Type  of  Residence 
Urban        Rural 


-1.736699  -1.686668 
-1.234197  -1.134174 


Fuel  Oil  &  Other   -1.522322  -1.534976 


Gasoline 


-1.078243  -1.100625 


-1.736699 
-1.234197 

-1.522322 
-1.078243 


-1.781366 
-5.125896 
-1.340497 
-1.076013 
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reflect  substitution  among  energy  types,  but  not  substitution  between 
energy  and  other  goods.   In  general  our  results  indicate  that  families 
are  very  sensitive  to  energy  price  changes,  given  the  level  of  total 
energy  expenditures. 

To  illustrate  tests  of  hypotheses  about  household  demand  for 
energy  using  the  model  (27)  and  (28),  we  test  the  hypothesis  that  gaso- 
line is  separable  from  the  other  energy  components.   This  hypothesis 
arises  naturally  since  energy  is  used  primarily  for  two  purposes:  trans- 
portation and  household  operation  (heating,  lighting  and  running  appli- 
ances).  Under  separability,  the  total  expenditure  allocation  process 
for  each  family  can  be  broken  down  into  three  stages:   first,  the  allo- 
cation of  expenditure  between  other  goods  and  the  energy  aggregate; 
second,  the  allocation  of  energy  expenditures  to  gasoline  and  a  "house- 
hold operation"  aggregate;  and,  finally,  the  allocation  of  household 
operation  funds  to  electricity,  natural  gas  and  other  fuels. 

Separability  of  gasoline  allows  the  function  (22)  to  be  written 
in  the  form: 


p1r   p2t   p3t         p4t 

Tkt  "  v  <VhT  •  TT  ■  TT  •  \,>  •  ITT  •  \t>  <30) 


where  the  index  4  refers  to  gasoline.   Within  the  context  of  the  trans- 
log  function  (24),  necessary  and  sufficient  conditions  for  this  struc- 
ture are: 
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341  =  B.4  =  0  ,  (i  =  1,  ...  4)       (31) 


which  comprise  two  independent  restrictions  given  homogeneity  and  sym- 
metry conditions. 

Within  the  context  of  the  exact  aggregation  model,  failure 
to  reject  separability  implies  that  each  household's  utility  func- 
tion displays  separability  of  gasoline  from  the  other  fuels.   This 
implies  that  a  "household  operation"  aggregate  price  and  quantity  could 
be  constructed  for  each  family.   However,  the  appearance  of  demographic 
differences  in  the  estimated  equations  implies  that  the  aggregate  price 
of  "household  operations"  will  vary  from  household  to  household.   The 
restrictions  implying  a  common  aggregate  price  are  much  stronger,  requir- 
ing all  demographic  effects  to  vanish  from  the  electricity,  natural  gas 
and  fuel  oil  and  other  equations;  these  restrictions  are  certainly  in 
conflict  with  the  significance  of  the  demographic  coefficients  noted 
above. 

We  perform  the  test  of  separability,  as  indicated  in  Section  3, 

by  first  estimating  the  model  subject  to  the  restrictions  (31),  and 

2 
then  comparing  S  (p)  -  S  (6)  to  x  (2)  critical  values,  where  S  (p)  is 

the  restricted  objective  function  value.   Table  3  contains  the  NL3SLS 

estimates  constrained  to  obey  separability.   In  comparison  to  Table  1, 

we  see  that  virtually  all  demographic  coefficients  are  identical.   For 

the  price  coefficients,  we  see  that  those  in  the  electricity  equation 


Table  3 
Pooled  Estimation  Results 
With  Separability  Imposed 
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PE 

.17077  (.0130) 

-.06073 

(.0129) 

PG 

-.06073  (.0129) 

.02364 

(.0183) 

p 

SOL 

-.11004  (.0110) 

.03709 

(.0091) 

PGAS0 

.00000  (.000) 

.00000 

(.000) 

C 

-.19584  (.0063) 

-.05763 

(.0055) 

F2 

.  .00891  (.0038) 

.01174 

(.0034) 

F3 

.01719  (.0045) 

.00624 

(.0041) 

F4 

.00543  (.0049) 

-.00044 

(.0044) 

F5 

.00125  (.0058) 

-.00931 

(.0052) 

F6 

.02737  (.0072) 

.00179 

(.0065) 

F7 

.01842  (.0076) 

.01844 

(.0068) 

A30 

-.04549  (.0056) 

-.03062 

(.0051) 

A40 

-.03222  (.0060) 

-.05210 

(.0054) 

A50 

-.03685  (.0057) 

-.04496 

(.0051) 

A60 

-.03062  (.0057) 

-.05126 

(.0051) 

A70 

-.-6183  (.0056) 

-.08594 

(.0050) 

RNC 

.00868  (.0039) 

-.07252 

(.0035) 

RS 

-.04458  (.0039) 

-.03331 

(.0035) 

RW 

.00647  (.0041) 

-.05026 

(.0037) 

BLK 

-.01614  (.0045) 

-.08792 

(.0040) 

RUR 

.01280  (.0036) 

.11145 

(.0032) 

-.11004  (.0110)     .0000  ( 

.03709  (.0091)     .0000  ( 

.07295  (.0134)     .0000  ( 

.0000  (.000)       .0000  ( 

-.17332  (.0051)  -.57338 

.00581  (.0026)  -.02646 

-.01069  (.0031)  -.01273 

-.00267  (.0033)  -.00232 

-.01296  (.0039)     .02102 

-.02756  (.0049)  -.00159 

-.03624  (.0051)  -.00061 

-.01490  (.0038)     .09099 

-.02094  (.0041)     .10526 

-.03441  (.0039)     .11622 

-.05445  (.0039)     .13633 

-.10292  (.0038)     .25068 

.09630  (.0026)  -.03246 

.11673  (.0026)  -.03884 

.10531  (.0027)  -.06151 

.00498  (.0028)     .09909 

-.11127  (.0024)  -.01298 


000) 

000) 

000) 

000) 

.0109) 

.0050) 

.0060) 

.0064) 

.0076) 

.0095) 

.0100) 

.0074) 

.0079) 

.0074) 

.0075) 

.0073) 

.0051) 

.0051) 

.0054) 

.0059) 

.0047) 


Convergence  after  3  Iterations 
S  (p)  =  85375.61 
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increase  somewhat  in  absolute  value,  with  the  remaining  (unconstrained) 
coefficients  declining  in  absolute  value. 

The  test  of  separability  is  performed  as  follows:  S  (p)  = 

85375.61,  S(9)  =  85366. 44,  so  that  S  (p)  -  S(6)  =  9.17.   The  one  percent 

2 
critical  level  for  a  x  (2)  is  9.21,  so  we  fail  to  reject  separability 

at  the  one  percent  level  of  significance.   With  the  amount  of  data 
employed,  a  one  percent  level  of  significance  is  reasonable;  if  anything, 
it  may  be  insufficiently  stringent.   Further  evidence  of  the  acceptabil- 
ity of  separability  is  given  by  Tables  4  A-E,  where  elasticities  calcu- 
lated from  the  estimates  of  Table  3.   A  comparison  with  Tables  2  A-E 
indicate  that  the  two  sets  of  elasticities  are  almost  identical,  even 
with  the  gasoline  elasticities  constrained  to  -1.00  under  separability. 


TABLE  4 
Own  Price  Elasticities 

A.  Varying  Family  Size 
3  A 
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7  or  mc 


Electricity 

Natural  Gas 

Fuel  Oil  & 
Other 

Gasoline 


■:,«.77,-».77.2»>T,....n.i .,:«»» -...»»«  —  — 

.,',,,4.,    -,.2412,2    -..«""    -1-2  — 8    -1',98iM    ■,-21"69    "'•"' 
./„„„    .,.3.7,12    -1.355744    -..17,444    -,.3,2047    -,.32,-74   ...31, 

./.oooooo  -..oooooo  .,.000000  -oooooo  -uoooooo  -..oooooo-..^ 


Electricity 
Natural  Gas 
Fuel  Oil  &  Other 
Gasoline 


Electricity 
Natural  Gas 
Fuel  Oil  &  Other 
Gasoline 


Electricity 
Natural  Gas 
Fuel  Oil  &  Other 
Gasoline 


15-24 


B.  Varying  Age  of  Head 

25-34  35-A4  45-54. 


55-64        65  or  more 
-,.«W77    -K7U307    -,.752905   -7.7J7M4   -1.758255    -l..«7.. 
.1    353070    -,.242272    -,.,9855,    -,.2,1226    -1.199952    -,.154*0, 
...39,625    M.3,2624    -1.352047    -,.330557    -1.303042    -,.257755 

.,.oo„ooo  -i.oooooo  ...oooooo  -.oooooo  -l.oooooo  -t.oooooo 


N.E. 


C.  Varying  Region 
N.C. §_ 


W 


-1.75290b  -1.782*53  -1.629211  -1.775003 
.1.19B551  -1-123388  -1.1"l«  -1.139611 
-1.352047  -1.657726  -1.80619*  -1.715932 

-l.oooooo  -i.oooooo  -i.oooooo  -i.oooono 


D.   Varying  Race 
White  Nonwhite 


E.   Varying  Type  of  Residence 
U  r  b  an Rural 


-1.752905  -1.702878 

-1.198551  -1.114202 

-1.352047  -1.360708 

-1.000000  -1.000000 


-1.752905 
-1.198551 
-1.352047 
-1.000000 


-1.797952 
-4.113869 
-1.229052 
-1.000000 
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Footnotes 

1.  An  entirely  different  motivation  for  an  exact  aggregation 

procedure  is  obtained  by  making  assumptions  on  the  distribution  of  x 

kt 

for  each  t.   Namely,  one  can  assume  that  (2)  is  the  correct  macro  model, 
and  that  x  is  a  sufficient  statistic  for  the  underlying  distribution. 
Then,  cross  section  regressions  (as  in  the  pooling  procedure)  consistently 
estimate  the  first  derivatives  of  the  macro  function,  i.e.   3(p  ,  6). 
See  Stoker  (1981)  for  the  justification  of  this  approach. 

2.  There  is  a  large  literature  on  pooling  time  series  and 
cross  section  data  in  a  linear  format  —  the  classic  paper  is  Balestra 
and  Nerlove  (1966).   See  also  Maddala  (1971)  and  Mundlak  (1978),  among 
others.   In  fact,  our  procedure  can  be  viewed  as  a  full  pooling  estima- 
tor, with  missing  cross  section  data  for  certain  time  periods.   See 
Stoker  (1978)  for  an  exposition  of  this  point.   Also  it  should  be  noted 
that  most  discussions  of  pooling  time  series  and  cross-section  data  are 
concerned  with  stochastic  structure  and  not  the  structural  model  given 
here.   Issues  regarding  fixed  and  random  effects  as  well  as  heterosced- 
asticity  problems  (see  for  example,  Amemiya  (1978))  can  be  studied  within 
the  framework  of  this  paper  although  we  do  not  explicitly  treat  them. 

3.   For  any  nonsingular  error  structure  there  always  exists  a 
diagonalizing  transformation;  however,  the  transformations  of  greatest 
practical  interest  do  not  Involve  the  distribution  of  x   across  the  popu- 
lation and  allow  the  error  structure  of  the  model  to  be  analyzed  using 
either  the  cross  section  or  time  series  data  base  individually.   Stoker 
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(1978)  gives  an  example  where  individual  errors  are  serieally  correlated 
but  a  simple  transformation  allows  the  correlation  to  be  studied  using 

only  the  average  data. 

A.   In  the  exposition  we  assume  that  the  variance  of  the  dis- 
turbance   conditional  on  the  instrumental  variables  is  constant  for 
both  cross-section  and  average  time  series  models.   If  this  assumption 
is  relaxed,  efficiency  gains  are  possible  by  adjusting  the  weighting 
matrix  of  equations  12  and  13.   See  White  (1980a,  1980b,  1982)  and 
Hansen  (1982)  for  details. 

5.  An  exposition  of  the  Gauss-Newton  method  for  single  equations 
can  be  found  in  Draper  and  Smith  (1966);  this  method  is  discussed  for  sys- 
tems of  nonlinear  regression  equations  by  Malinvaud  (1980). 

6.  If  the  correction  (8)  is  applied  x*  should  replace  x 

o  o 

here. 

7.  Matrix  weighted  averages  are  discussed  in  Chamberlain  and 
Learner  (1976)  and  Mundlak  (1978),  among  others. 

8.  The  model,  data  and  results  are  discussed  in  much  greater 
detail  in  Jorgenson,  Lau  and  Stoker  (1981). 

9.  The  theory  of  multi-stage  budgeting  is  reviewed  in  Black- 
orby,  Primont  and  Russell  (1978). 

10.  For  these  restrictions  see  Jorgenson  and  Lau  (1975). 

11.  In  our  previous  notation,  p  is  the  vector  of  component 
prices  and  x  refers  to  the  constant  and  attribute-expenditure  statis- 
tics at  time  t. 
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12.  Nonlinear  three  stage  least  squares  estimates  are  invari- 
ant to  the  choice  of  omitted  equation. 

13.  Expenditures  on  all  four  components  are  observed  in  each 
data  set  except  the  1960-61  CES,  where  electricity  and  natural  gas  are 
combined.   Utilizing  this  data  requires  only  a  trivial  modification  in 
the  construction  of  the  appropriate  moments  of  M  and  M  .   See  Jorgenson, 
Lau  and  Stoker  (1981)  for  details. 

14.  More  formally,  the  model  in  each  cross  section  is  a  multi- 
variate linear  regression  model  with  respect  to  the  coefficients  6(p  ,  8). 
Best  linear  unbiased  estimation  of  such  a  model  is  equivalent  to  OLS 
applied  to  each  equation. 

15.  We  list  these  instruments  in  the  Appendix. 

16.  The  existence  of  demographic  effects  contradicts  the  existence 
of  a  common  aggregate  price  of  energy  for  all  households. 
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APPENDIX  TABLE 
AGGREGATE  INSTRUMENTAL  VARIABLES 


The  variables  used  as  instruments  for  the  aggregate  time 
series  portion  of  the  model  are  as  follows: 


11  Constant 

12  TL  —  effective  tax  rate,  labor  services 

13  TCR  —  effective  tax  rate,  noncompetitive  imports 

14  LH  —  time  available  for  labor  services 

15  p  —  u.S.  population,  millions  of  individuals 

16  PL  —  implicit  deflator,  supply  of  labor  service 

17  PLG  —  implicit  deflator,  government  purchases  of  labor 
services 

18  EL-HR-RT  —  exogenous  income  ,  which  equals  govern- 
ment transfers  to  persons  (excepting  social  insurance) 
less  personal  transfers  to  foreigners  and  personal  nontax 
payments  to  government 

19  W(-l)  —  private  national  wealth,  lagged  one  period 

110  LH- (1+H)T  —  potential  time  for  labor  services;   H  —  rate 
of  Harrod  neutral  change 

111  Total  imports 

112  PCR  —  implicit  deflator,  noncompetitive  imports 

113  PL/(1+H)T  —  corrected  deflator  for  labor  services 

114  T  —  time,  set  to  0  in  1972 
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